BIOTECHNOLOGY afl e^-O- 



RAW SEQUENCE LISTING 
ERROR REPORT 




The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 

Application Serial Number: 

Source: Olf^ j / 

Date Processed by STIC: / /3o/2o/>'3^ 

THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS, 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308-4216. 
PATENTIN 2.1 e-mail help: patin21help@uspto.gOV or phone 703-306-4119 (R. Wax) 
PATENTIN 3.0 e-mail help: patin3help@uspto.gov or phone 703-306-4119 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.1 PROGRAM , ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW FOR ADDRESS: 
http;//www,uspto,gov/web/offices/pac/checker 

Applicants submitting genetic sequence information electronically on diskette or CD-Rom should be aware that there 

a possibility that the disk/CD-Rom may'have been affected by treatment given to all incoming mail. 

Please consider using alternate methods of submission for the disk/CD-Rom or replacement disk/CD-Rom. 

Any reply including a sequence listing in electronic form should NOT be sent to the 20231 zip code address for the 

United States Patent and Trademark Office, and instead should be sent via the following to the indicated addresses: 

1. EFS-Bio (<http://www,uspto.gov/ebc/efs/downloads/documents>htm> , EFS Submission 
User Manual - ePAVE) 

2. U.S, Postal Service: U.S. Patent and Trademark Office, Box Sequence, P.O. Box 2327, Arlington, VA 22202 

3. Hand Carry directly to: 

U.S. Patent and Trademark Office, Technology Center 1600, Reception Area, 7* Floor, Examiner Name, 
Sequence Information, Crystal Mall One, 191 1 South Clark Street, Arlington, VA 22202 
Or 

U.S. Patent and Trademark Office, Box Sequence, Customer Window, Lobby, Room 1B03, Crystal Plaza Two, 
20 1 1 South Clark Place, Arlington, VA 22202 

4. Federal Express, United Parcei'Service, or other delivery service to: U.S. Patent and Trademark Office, 
Box Sequefice, Room IBOS-Maihoom, Crystal Plaza Two, 201 1 South Clark Place, Arlington, VA 22202 



Revised 01/29/2002 



Raw Sequence Ustine Error Summaiy 



ERROR DETECTED SUGGESTED CORRECTION SERIAL NUMBER: 

ATTN: NEW RULES CASES: PLEASE DISREGARD ENGLISH -ALPHA" HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 



1 Wrapped Nucleics 

Wrapped Aminof 



The ntjmberAcxt at the end of each line -wrapped^ down to the next line. ThU may occur if your file 
was retrieved in a word processor after creating iL Please adjust your right margin to J; this will 
prevent ••wrapping.*' 



2 Invalid Line Length The rules require that a line not exceed 72 charadcn in length. This includes white spaces. 



3 \^ Misaligned Amino 

Numbering 

4 Non>ASCIl 



The numbering under each 5* amino acid is misaligned. Do not use tab codes between numbers; 
use space chanctera^ insteadL 

n^c nibmitted file was not saved in ASCn(DOS) text, as required by the Sequence Rules. Please 
ensure your subsequent submission b saved In ASCII text 



3 Variable Length Scqucnce<j) ^contain n's or Xaa's representing more than one residue. Per Sequence Rules, 

- _ — — I - — n* — ..^ ij^c TTisxijiTuin number o* cat** 



each n or Xaa can oiily represehi a smgie resiauc. rica^ pfcscm 



6 Patentin 2.0 

iHig- 



^Skipped Sequences 



(OLD RULES) 



8 Skipped Sequences 

• (NEW RULES) 



Use of n*s or Xaa*s 



10 



13 



(NEW RULES) 

_lnvalid<213> 
Response 

Use of <220> 



Patentin 2.0 



residue having variable length and indicate in the <220>-<223> scctioiHhat some may be missing. 

A *'bug- in Patentin version 2.0 has caused the <220>-<223> section to be missing from amino acid 

sequences(s) . Normally, Patentin would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence^ This applies to the mandatory <220>-<223> sections for 
Artlfldal or Unknown sequences.' 

Sequence(s) missing. If intentional, please insert the following lines for each skipped sequence: 

(2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where "X*' is shown) 
(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 

(xi) SEQUENCE DESCRIPTION:SEQ ID NO:X: (insert SEQ ID NO where -X^Js showh) 
This sequence is intentionally skip|>ed 

Please also adjust the "(ii) NUMBER OF SEQUENCES:" response to Include the skipped sequences. 

Sequence(s) missing. If Intentional, please insert the following lines for each skipped sequence. 

<2 10> sequence id number 
<400> sequence id number 
000 

Use of n*» and/or Xaa's have been detected in the Sequence Listing. 

Per 1.823 of Sequence Rules, use of <220>^23> is MANDATORY if n's or Xaa's arc present 

In <220> to <223> section, please explain k>cation of n or Xaa; and which residue n or Xaa represents. 

Per 1.823 of Sequence fGHIes; the only vaMd <213> responses are: Unknown. Artiiicial Sequence, or 
scientific name (Genus/species> <220X223> section is required when <213> response is Unknown or 
is Artificial Sequence . . • ,« 

Sequence(s) missing the <220> "Feature** and associated numeric identifiers and responses. 

Ui^ of <220> to <223> U MANDATORY if <2 13> •X>ganism'' response is -Artificial Sequence" or 

-Unknown.- Please explain source of genetic material in <220> to <223> section. 

(See -Federal Register.- 06«)l/1998. Vol 63. Na 104, pp. 29631-32) (Sec 1.823 of Sequence Rules) 

Please do not use "Copy to Disk" function of PatenUn venioo 2.0. This causes a corrupted file, 
resuhing in nuuing mandatory numeric identifiers and responses (as iftdicated .on raw sequence . . 

-listing). Instead. Incase use -File Manager or aiiy other manual means to'copy file to floppy di^ 



13 



Misuse of n 



n can only be used to represent a single nucleotide in a nucleic acid sequence. N is noi used to represent 
any vahie not spedAcaUy a mdeottde. 

AMG/MH- Biotechnology Systems Brucfa - 08/21/2001 




OIPE 




RAW SEQUENCE LISTING DATE: 01/30/2002 

PATENT APPLICATION: US/10/042,091 TIME: 15:27:08 . 

Input Set : A:\ES.txt. 

Output Set: N:\CRF3\01302002\J042091.raw ^ 

Does Not Compiv 

2 <110> APPLICANT: Andrade- Gordon, Patricia ConBCted Diskette Needed 

4 Darrow, Andrew 

6 Qi, Jensen 

7^'<120> TITLE OF INVENTION: DNA Encoding Human Serine Protease EOS 

<130> FILE REFERENCE: ORT-1031 
9 <140> CURRENT APPLICATION NUMBER: 
10 <141> CURRENT FILING DATE: 2002-01-08 
■-> 11 <160> NUMBER OF SEQ ID: 7 

12 <170> SOFTWARE: Patentin Ver. 2.0 ^ ' 

ERRORED SEQUENCES 

90 <210> SEQ ID NO: 7 

91 <211> LENGTH: 284 . n 

92 <212> TYPE: PRT OJ ^ 

93 <213> ORGANISM: Homo sapiens ' 
W--> 94 <400> SEQUENCE: 7 

95 Met Arg Gly Val Ser Cys Leu Gin Val Leu Leu Leu Leu Val Leu Gly 

96 1 5-^ 10-^ 15r> \ 

97 Ala Ala Gly Thr Gin Gly Arg Lys Ser Ala Ala Cys Gly Gin Pro Arg ^ \ 0 

E--> 98-^ aa, Zo /^yyy.^^x€JL^^^ 



E- 


-> 


100 


35 










40 










45 
















101 


Pro 


Trp 


Gin 


Ala 


Ser 


He 


Gin 


His 


Pro 


Gly 


Ala 


His 


Val 


Cys 


Gly 


Gly 


E- 


-> 


102 


50 










55 










60 
















103 


Ser 


Leu 


He 


Ala 


Pro 


Gin 


Trp 


Val 


Leu 


Thr 


Ala 


Ala 


His 


Cys 


Phe 


Pro 


E- 


-> 


104 


65 










70 










75 










80 






105 


Arg 


Arg 


Ala 


Leu 


Pro 


Ala 


Glu 


Tyr 


Arg 


Val 


Arg 


Leu 


Gly 


Ala 


Leu 


Arg 


E- 


-> 


106 


85 










90 










95 
















107 


Leu 


Gly 


Ser 


Thr 


Ser 


Pro 


Arg 


Thr 


Leu 


Ser 


Val 


Pro 


Val 


Arg 


Arg 


Val 


E- 


-> 


108 


100 










105 










110 
















109 


Leu 


Leu 


Pro 


Pro 


Asp 


Tyr 


Ser 


Glu 


Asp 


Gly 


Ala 


Arg 


Gly 


Asp 


Leu 


Ala 


E- 


-> 


110 


115 










120 










125 
















111 


Leu 


Leu 


Gin 


Leu 


Arg 


Arg 


Pro 


Val 


Pro 


Leu 


Ser 


Ala 


Arg 


Val 


Gin 


Pro 


E- 


-> 


112 


130 










135 










140 
















113 


Val 


Cys 


Leu 


Pro 


Val 


Pro 


Gly 


Ala 


Arg 


Pro 


Pro 


Pro 


Gly 


Thr 


Pro 


Cys 


E- 


-> 


114 


145 










150 










155 










160 






115 


Arg 


Val 


Thr 


Gly 


Trp 


Gly 


Ser 


Leu 


Arg 


Pro 


Gly 


val 


Pro 


Leu 


Pro 


Glu 


E- 


-> 


116 


165 










170 










175 
















117 


Trp 


Arg 


Pro 


Leu 


Gin 


Gly 


Val 


Arg 


Val 


Pro 


Leu 


Leu 


Asp 


Ser 


Arg 


Thr 


E- 


-> 


118 


180 










185 










190 
















119 


Cys 


Asp 


Gly 


Leu 


Tyr 


His 


Val 


Gly 


Ala 


Asp 


val 


Pro 


Gin 


Ala 


Glu 


Arg 



i 



file://C:\CRF3\Outhold\VsrJ042091.htm 
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^ Page 2 of 4 



RAW SEQUENCE LISTING DATE: 01/30/2002 

' PATENT APPLICATION: US/10/042,091 TIME: 15:27:09 

Input Set : A:\ES.txt 

Output Set: N:\CRF3\01302002\J042091.raw 

E--> 120 195 200 205 

121 lie Val Leu Pro Gly Ser Leu Cys Ala Gly Tyr Pro Gin Gly His Lys 
E--> 122 210 215 220 

123 Asp Ala Cys Gin Gly Asp Ser Gly Gly Pro Leu Thr Cys Leu Gin Ser 
E--> 124 225 230 235 240 

125 Gly Ser Trp Val Leu Val Gly Val Val Ser Trp Gly Lys Gly Cys Ala 
E--> 126 245 250 255 

127 Leu Pro Asn Arg Pro Gly Val Tyr Thr Ser Val Ala Thr Tyr Ser Pro 
£--> 128 260 265 270 

129 Trp lie Gin Ala Arg Val Thr Ser Asn Ala Ser Arg 

E--> 130 27^5 280 

E--> 131 6 - 



135 /ORT-1031 



y2JUi /^^-^ ^^^^ 



file://C:\CRF3\Outhold\VsrJ042091.htm 



1/30/02 



<210> 2 
<211> 20 
<212> DNA 

<213> Artificial Sequence ^ 
<220> ^ ^ ^UajuJ 

<223> Des£xip£ion of Artificial Sequence: ^.T^ y^y^/y^^v^ ajUJJ^ '^"^^^''^^^ 

gagaaagtca gattcacagc 20 ytJ^^^^ ' ' 

<210> 3 C ) 

<211> 20 lyi^^ 
<212> DNA / 
<213> Artificial Sequence ^yQKMyir^i'^^^^ 
<220> 

<223>.Jiesccip^^ion of Artificial Sequence: 
'^ligonucleotid 



JM') 



Hie types of errors shown exist throughout 
the Sequence Listing. Please check subsequent 
sequences for similar errors. 



4 



VERIFICATION SUMMARY DATE: 01/30/2002 

PATENT APPLICATION: US/10/042,091 TIME: 15:27:10 

Input Set : A:\ES.txt 

Output Set: N:\CRF3\01302002\J042091.raw 



L:7 M:283 W: Missing Blank Line separator, <120> field identifier 
L:8 M:283 W: Missing Blank Line separator, <130> field identifier 
L:9 M:283 W: Missing Blank Line separator, <140> field identifier 



L 


10 


M 


271 


C: 


Current 


Filing Date differs. 


Replaced Current Filing 


L 


11 


M 


283 


W: 


Missing 


Blank 


Line 


separator, 


<160> 


field 


identifier 


L 


13 


M 


283 


W: 


Missing 


Blank 


Line 


separator. 


<210> 


field 


identifier 


L 


17 


M- 


283 


W: 


Missing 


Blank 


Line 


separator, 


<400> 


field 


identifier 


L 


49 


M 


283 


W: 


Missing 


Blank 


Line 


separator, 


<220> 


field 


identifier 


L 


52 


M: 


283 


W: 


Missing 


Blank 


Line 


separator, 


<400> 


field 


identifier 


L 


58 


M: 


283 


W: 


Missing 


Blank 


Line 


separator, 


<220> 


field 


identifier 


L 


61 


M 


283 


W: 


Missing 


Blank 


Line 


separator. 


<400> 


field 


identifier 


L 


67 


M 


283 


W: 


Missing 


Blank 


Line 


separator. 


<220> 


field 


identifier 


L 


70 


M. 


283 


W: 


Missing 


Blank 


Line 


separator, 


<400> 


field 


identifier 


L 


76 


M: 


283 


W: 


Missing 


Blank 


Line 


separator. 


<220> 


field 


identifier 


L 


79 


M:283 


W: 


Missing 


Blank 


Line 


separator. 


<400> 


field 


identifier 


L 


85 


M 


283 


W: 


Missing 


Blank 


Line 


separator, 


<220> 


field 


identifier 


L 


88 


M:283 


W: 


Missing 


Blank 


Line 


separator, 


<400>. 


field 


identifier 


L 


94 


M: 


283 


W: 


Missing 


Blank 


Line 


separator,. 


<400> 


field 


identifier 


L 


98 


M 


332 


E: 


(32) Invalid/Missing Amino Acid Numbering, 


SEQ ID: 7 , 



M:332 Repeated in SeqNo=7 
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